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ABSTRACT 



An inner product computing unit computes inner prod- 
ucts of an input pattern whose category is unknown, 
and orthogonalized dictionary sets of a plurality of 
reference patterns whose categories are known. A non- 
linear converting unit nonlinearly converts the inner 
products in accordance with a positive-negative sym- 
metrical nonlinear function. A neural network unit or a 
statistical discriminant function computing unit per- 
forms predetermined computations of the nonlinearly 
converted values on the basis of preset coefficients in 
units of categories using a neural network or a statistical 
discriminant function; A determining section compares 
values calculated in units of categories using the preset 
coefficients with each other to discriminate a category 
to which the input pattern belongs. 

4 Claims, 6 Drawing Sheets 
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1 2 

where (•) indicates an inner product, and 1 1 1 1 indicates 

PATTERN RECOGNITION SYSTEM AND a norm. 

METHOD USING NEURAL NETWORK The pattern matching method according to the sub- 
space method is widely used in pattern recognition 

This application is a continuation of application Ser. 5 since it can relatively easily obtain an accurate recogni- 

No. 07/447,405, filed on Dec. 7, 1989, now abandoned. tion result. 

~ . ~, ~_™~^ la the conventional pattern recognition processing 

BACKGROUND OF THE INVENTION using ^ subspace meth ^ j „ showi f in ^ abovemen- 

1. Field of the Invention tioned equation, the inner products of the input pattern 
The present invention relates to a pattern recognition 10 (F) and the orthogonal axes (<J>(K,ra)) of the orthogonal- 
system and method, which can accurately recognize an ized dictionary sets are merely accumulated, and the 
input pattern such as speech, character, figure, and the overall feature of the input pattern is evaluated using 
like and, more particularly, to an improvement in a the accumulation value. In other words, the pattern 
partial space method. recognition is performed not by using the individual 

2. Description of the Related Art 15 inner products obtained between the input pattern and 
In recent years, studies about pattern recognition the orthogonal axes but by using the accumulation 

processing for speech, character, figure, and the like value representing the overall feature. For. this reason, 

have been made, and the pattern recognition processing when an inner product with respect to a given orthogo- 

has been receivings great deal of attention as ah impor- nal axis takes a large value, which cannot be obtained 

tant technique for realizing a natural man-machine in- 20 with a correct pattern, due to noise, the accumulation 

terface. result of the inner products tends to become a value 

A pattern recognition apparatus for executing pattern l^ger than that of the inner products for the correct 

recognition processing of this type basically has an pattern. When pattern matching processing is per- 

arrangement shown in FIG. 1. The respective units formed using the subspace method of this type, a deter- 

perform the following processing. 25 urination error (recognition error) caused by a category 

1) A feature extracting unit 1 analyzes an input pat- other than an object to be recognized and various noise 
tern to obtain its feature pattern. components tends to occur. 

2) By referring to a reference pattern memory 2 In order to solve thls problem, recently, pattern 
which stores reference patterns obtained in units of matching processing using a multi-layer neural network 
categories of patterns to be recognized as dictionaries, 30 * receiving a great deal of attention. A nonlinear dis- 
similarities or distances between the reference pattern criminant function is realized by the neural network, for 
dictionaries and the feature pattern are computed by a " sequentially transmitting neurons carrying information 
pattern matching unit 3. 10 extr f 1 , fe *! ure data of «? »P« ™* 

3) The collation results are determined by a determin- nctwork of type poses the problem of how to deter- 
ing unit 4 to obtain a recognition result of the input 35 nune a set of coefficients for defining the way of corn- 
pattern. Note that the determining unit 4 obtains a cate- bming^outputs from lower-order layers As an algo- 
gory name of the reference pattern which has the high- «^ for obtaining these coefficient^ a back propaga- 
est similarity value (or the smallest distance value) with * on £f > i S ^? Wn (NATURE \ ° : 323 K 9 ' 
the input pattern as a recognition result or a recognition . PP- 553-536 1986, Oct) Learning representations by 
candidate for the input pattern. When the input pattern 40 back-propagation errors). Various reports announced 
is a speech waveform, the feature extracting unit 1 per- * at P a « ern raatchm S processing could be accurately 
forms Band Pass Filter (BPF) analysis or Linear Predic- ex ' cuted u P° n W^on of the BP algorithm. 

~ ~ nr«\ i • t .u • ♦ „ ™u « a However, when a neural network is used, an enor- 

tion Coding (LPC) analysis of the input speech, and ' _ . > . . • . «- ■ 

■ . * u ■ ♦ i *u ■ • « ^ rnous amount of computations for determining coefFici- 

then detects a speech interval, thus obtaining acoustic JC 4 c ■ , . ♦ u _r -a ~~a - i« a ^ 

... . f, . • . ' . • 45 ents of each layer must be performed, and a load of 

analysis data in this speech interva as an input pattern. . J . v . 

4 , k v „ . . * • *t. computation processing is very large. 

When the input pattern is a character image, the * y .. 

feature extracting unit 1 quantizes the input character SUMMARY OF THE INVENTION 

image, and then extracts a character portion, thus ob- , t . ^ Qbject of ^ m invention t0 ovide a 

taming feature data of the character pattern as an input 5Q p attern - rcco ^ nition sysle F m method which M re . 

pattern. . duce a load on computation processing by utilizing 

As a method m the pattern recognition processing, a advant of a neural network< 

subspace method is known. Pattern recognition using , n onJer tQ achieve ^ above objectj according t0 a 

a l^m ? 8 ^ mC ? 15 de 1 S ^" bed m ' Cg " U S * Pat N °* first aspect of the present invention, a pattern recogni- 
4,624,011 (Watanabe et. al) • 55 tion system comprises: orthogonalized dictionary means 
In the subspace method, as reference pattern dictio- fm s{Qli a lurality of re fe re nce patterns whose cate- 
naries of categories, dictionaries (<KK,m); K is category gQries m kaam; mnef product computing means for 
name, and m is the number of an orthogonal axis; m= 1, computing mncr pro ducts of an input pattern whose 
2, . . . , M) which are orthogonalized in advance by KL category is unkn0W n and the plurality of reference 
expansion in units of categories .are created as orthogo- ^ patterns which are stored in the orthogonalized dictio- 
nalized dictionary sets. Similarities S(K) between the nary means ^ whose cate gories are known; convert- 
dictionaries and an input pattern (F) are computed ac- ing means for non i inear i y converting the inner prod- 
cording to the following equation to perform pattern ucts which are computed by the inner product coraput- 
matching processing: me ans, in accordance with a positive-negative sym- 

65 metrical nonlinear function; evaluation value comput- 
er » (i) ing means for computing evaluation values on the basis 
" m i i {F ' <J, <^'")^ / I IH of the noniineariy converted values by the convening 

means and coefficients which are preset in units of cate- 
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gories; comparing means for comparing the evaluation mainly expresses a displacement in a frequency direc- 

values obtained by the evaluation value computing tion, and the eigenvector 4»3 of the third axis mainly 

means in units of categories with each other; and dis- . expresses a plosive portion of an affricate/cho/([t / o]). 

criminating means for discriminating a category to Therefore, when the eigenvector of the third axis is 

which the input pattern belongs on the basis of the 5 referred, the input pattern can be recognized to be dis- 

comparison results of the comparing means. tinguished from /sho/([/o]) described above. Further- 

According to a second aspect of the present inven- mor e, the eigenvector <f> 4 of the fourth axis mainly ex- 

tion, a pattern recognition method comprises the steps presses a displacement in a time direction, 

performed by a computer of: computing inner products i n this example, eigenvectors of fifth and subsequent 

of an input pattern whose category is unknown and 10 axes are not illustrated. However, information carried 

orthogonalized dictionary sets of a plurality of refer- by each eigenvector is not so clear like the eigenvectors 

ence patterns whose categories are known; nonlinearly of the first trough fourth axes, 

converting the inner products, which are computed in In lhis manner> ^ eigenvectors fa fa t fa and <f> 4 

the computing step, in accordance with a positive-nega- (orthogonalized patterns) constituting an orthogonal- 

tive symmetrical nonlmear function; computing evalua- 15 ^ dictionary obtained by ^ kl cxp ansion well 

tion values on the basis of the nonlinearly converted ^ a variation m a h (or character , or figure) 

values by the converting step and coefficients which are ttern ^ resembles feature 

extraction in lower- 

preset n i units of categones; comparing the evaluation wder , m , mm4 neural nctwork . 

values oMaincd by the evaluation value computing step ThereW based 

on this point of view, a process of 

in units of categones with each other; and d.scnmmat- 20 CO m P »lm S > an inner product of an input patten, and an • 

uig a category to wh.ch the input pattern belongs on the orth £ £ vector is F considered ^ a H fir £ 0ovleTtoost) 

basis of the comparison results of the comparing step. i • *u i * i • *t_ * • j 

K 6 y layer m the neural network in the present invention, and 

BRIEF DESCRIPTION OF THE DRAWINGS a neural network for receiving the outputs from the first 

itt/- i v „ wi««l e ♦ • i .* l aver » s constituted to execute pattern matching pro- 

FIG. 1 is a block diagram of a typical pattern recogni- 25 . A w «- • . . r- . 

tion apparatus* cessing. As a result, coefficient computations of mputs 

trine <»a #L- u m ~r _u «i of a number of dimensions at the lowermost layer 

veS^i'S h f» 2? ™S£ 8 «"> be More specifically, the first layer portion 

vectors obtained by a K.L expansion; . . . - S n « , . ?\ , 

FIG. 3 is a chart showing a nonlinear function nor- ,s ^ 8 , rated fr ° m ,he ?P algorithm, and only a higher- 
mally used in a neural network; 30 ord A er ,ayer P ortl ? n * ""lependently computed. 

FIG. 4 is a chart showing a positive-negative sym- . As % ? f network, the number of 

metrical nonlinear function introduced in the present layers ,£ 0r wh * h c **f ffl ™ ~ m P Uted by the BP 
invention- algorithm can be reduced to 1 or 2. Therefore, pattern 

FIG. S'is a block diagram showing the first embodi- ^hing processing with a considerably reduced com- 
ment of a pattern recognition system according to the 35 P u * t,on volum , e ca " be reall2ed - , 
present invention- process of combining outputs from a lower-order 

FIG. 6 shows a neural network of the pattern recog- layer by , a «»lineu conversion is provided between 
nition system shown in FIG. 5; and ^J 4 "? 1 layers ° f ,* e " eUral net . work - As a f™* 1011 . of 

FIG. 7 is a block diagram showing the second em- «cutmg * he nonlinear conversion, a sigmoid function 
bodiment of the present invention. » expressed by the following formula (FIG. 3) is often 

used: 

DETAILED DESCRIPTION OF THE 

PREFERRED EMBODIMENTS /<*>= </<> +*"*) 

^T^t 10 3 P" ,ern f . reco .« mtion sys,e , m «*. „ where, is the total sum of inputs. The inner products of 
method of the present invention inner products of an 45 h fa an(J th £ , vectori r (eigenvec . 

input pattern and ^orthogonalized d.ctionary sets are f £ ^ d ^ k positive/negative values. 

rSlT? d ^ 3 ,? H f Ve - ne6at,V 1 e Sy , ra,ne,n " a "" « a 8nU«de of he absolute value of each inne 
cal nonlinear function, and the converted values are j * j r * -u *• * *u 

j * ' • j * product represents a degree of contribution to the cor- 

subjected to predetermined computation processing r r . - ° iL - 

_^ +r .JL- , - . . # f r „ • «a respondmg axis. Therefore, there is a new problem that 

using predetermined coefficients in units of categones. 50 r - . , . K .. , 

As a result, a variation in input pattern can be effec- thc n °° lln " r f ™ ct,on r 35 * e "S™ 0 " 5 funcnon 

lively absorbed by simple computation processing, and be dlrec * ly app ' ,ed - 

the input pattern can be accurately recognized. . \ thc Present invention, a pos.t.ve^egat.ve symrnet- 
For example, eigenvectors of a speech pattern are ncal nonl ?" ear f ? nc,,on shown m ™- 4 18 ado P* ed t0 
obtained as shown in FIGS. 2A through 2D. Eigenvec- 55 P^ 0 ™ n0 ! Umear c o nversion of the ^ P roducts - 
tors fc, *3, and <(. 4 of first through fourth axes . FIG ; 4 ' 8 [ unc : t,on A 1S 8 PO««ve.negative syrmnet- 
shown in FIGS. 2A through 2D are those of orthogo- nca ' f » nct,on to yH« characteristics similar to the sig- 
nalized dictionary patterns (eigenvectors) obtained up !" 0,d ftlnct,on wlthm the ran 8 e of 1*1 - 1 ' and » 8 ,ven 
to a tenth axis from speech patterns of a monosyllable y: 
speech/cho/([t /o]) pronounced by 300 speakers by the 60 

KL expansion. In FIGS. 2A through 2D, • represents a . ■ jin2jrjt . 

positive vector value, and ® represents a negative vec- ^/w = * - | | 
tor value. 

The eigenvector <J>, of the first axis expresses a feature A fme6m B show „ in FIG 4 k fl quadratic functjon 

(vector) of an average input pattern. However, since a 65 „: ven 
plosive portion of /cho/([t Jo]) is obscured, the input 

pattern cannot be recognized to be distinguished from Btfa^x 1 
/sbo/([/o]). The eigenvector £2 of the second axis 
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Furthermore, a function C exhibits linear characteristics 
in positive and negative regions, and is given by: 

5 

The inner products of the input pattern and the orthog- 
onal vectors are nonlinearly converted using the posi- 
tive-negative symmetrical nonlinear functions A, 8, and 
C, thus obtaining a means for transferring the outputs to 
the higher-order layers of the neural network. There- 10 
fore, high-performance pattern recognition processing 
can be realized by a combination of the subspace 
method and the neural network, each of which compen- 
sates for the drawbacks of the other. 

When the nonlinear conversion described above is * 3 
performed in practice, the absolute values of inner prod- 
ucts of an input pattern and orthogonal vectors are 
computed, and a positive-side nonlinear conversion . 
shown in FIG. 4 may be performed for the absolute 
values. 20 

After the inner products of an input pattern and the 
orthogonal axes are nonlinearly converted by the sym- 
metrical nonlinear function, these values G=(S1, S2, . . 
, , SM) may be input to a statistical discriminant func- ^ 
tion in place of the neural network. In this case, for 
example, a Bayes decision rule given by the following 
equation is utilized to discriminate the input pattern. 

«p{-()/2) . (C - ludTRfkG - jtx)) (2) 30 

kK) ~ (hr/W.IJjrl*' 



where \ik and R/r are respectively the mean vector and 35 
the covariance matrix of an input pattern G obtained 
from learning data. T indicates transposition, and 
Rk— 1 and |R^| are respectively the inverse matrix of 
the covariance matrix K/c and the value of the determi- 
nant. 4Q 

In the Bayes decision, values given by equation (2) 
are computed in units of categories, and a category 
providing a maximum value is discriminated as a recog- 
nition result. When a Maharanobis distance is used as a 
simplified method of the Bayes decision, pattern recog- 45 
nition processing may be performed according to the 
following equation: 

^W^-ma:) 7 **- 1 ^-^) ... (3) 

50 

When the statistical discriminant function is used in 
place of the neural network, the same effect can be 
obtained. 

FIG. 5 is a block diagram when pattern recognition 
of the present invention is applied to speech recogni- 55 
tion. 

Input speech is input to a feature extracting unit 11. 
The feature extracting unit 11 samples the input speech 
at 12 kHz to analog-to-digital convert the input speech 
into 12-bit digital data, and computes its power and 60 
LPC analysis parameters. This computation processing 
is performed under conditions that a window length is 
24 msec, a frame period is 8 msec, an order of LPC 
analysis is 16, and the number of terms of the LPC 
mel-cepstrum is 16. Thereafter, the feature extracting 65 
unit 11 detects an input speech interval, and extracts 
power and LPC analysis parameter in this interval as a 
speech feature (input pattern). The unit 11 then supplies 
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the extracted feature to an inner product computing 
unit 12. 

The inner product computing unit 12 executes inner 
product computations of the input pattern obtained as 
described above and orthogonalized dictionary sets 
which are prepared in units of categories to be recog- 
nized and are stored in an orthogonalized dictionary 
memory 13. The inner products between the input pat- 
tern and the orthogonalized dictionary sets are supplied 
to a symmetrical nonlinear converting unit 14 and a 
similarity computing unit 15 based on the partial space 
method. . 

The symmetrical nonlinear converting unit 14 nonlin- 
early converts the inner products in accordance with 
the positive-negative symmetrical nonlinear function 
shown in FIG. 4, and supplies the converted values to a 
neural network unit 17. The similarity computing unit 
15 executes similarity computation processing by the 
subspace method given by equation (1) for the inner 
products. The similarity computation result obtained by 
the similarity computing unit 15 is supplied to a prede- 
termining section 16. The predetermining section 16 
determines whether the value from the similarity com- 
puting unit 15 is to be output as a recognition result or 
is to be discriminated by the neural network unit 17. 

The neural network is provided with coefficients 
shown in FIG. 5 in the same manner as in the orthogo- 
nalized dictionary sets. These coefficients are computed 
in advance in units of categories to be recognized on the 
basis of learning data using the above-mentioned BP 
algorithm. More specifically, after the learning pattern 
is nonlinearly converted as described above, the weight 
coefficients are computed based on the Back Propaga- 
tion (BP) algorithm in such a way that G=(Si, S2. . . • 
§ n ) is used as the input pattern of a fully connected 
multi-layer neural network which outputs 0.9 and 0.1 if 
the learning pattern is true and false, respectively. In 
this embodiment, the neural network is a multi-layer 
neural network each layer having a plurality of ele- 
ments and each element including a multiplier for multi- 
plying an input signal from a lower adjacent layer by a 
preset weight coefficient and connected to all the ele- 
ments constituting the upper adjacent layer. 

The neural network unit 17 is provided in units of 
categories, as shown in FIG. 5, and receives the nonlin- 
early converted values G (=(S1, S2, . . . , SM); M is the 
number of orthogonal axes) to compute sums of prod- 
ucts with the coefficients along connecting lines shown 
in FIG. 5. The unit 17 combines the sums in units of 
nodes, and executes the nonlinear conversion based on 
the sigmoid function to obtain signals yl, y2, . . . , yL: 
Similarly, the unit 17 computes sums of products be- 
tween these signals and the coefficients, and executes 
nonlinear conversion. The unit 17 then combines the . 
converted values to obtain an output associated with a 
given category. Of neural network outputs for all the 
categories obtained in this manner, a category which 
provides a maximum output is discriminated as a recog- 
nition result, thereby recognizing the input pattern. 

FIG. 6 shows a neural network of the speech recogni- 
tion system shown in FIG. 5. The same reference nu- 
merals in the neural network in FIG. 6 denote the same 
parts as in FIG. 5. 

Input patterns output from the feature extracting unit 
11 are subjected to an inner product computation 
(F'$(K tm )) given by equation (1) and are added by the 
inner product computing unit 12. The sum output val- 
ues from the inner product computing unit 12 as the 
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inner products are nonlinear! y converted by the sym- 
metrical nonlinear converting unit 14. The neural net- 
work unit 17 further computes inner products of 
weighting coefficients and the output values yjO), yjO), 
• • - 1 yss from the symmetrical nonlinear converting unit 5 
14 in units of categories, computes a total sum of the 
inner products, and then nonlinearly converts the total 
sum according to the sigmoid function, thus obtaining 

The similarity computing unit 15 computes a total 10 
sum of values obtained by multiplying the output values 
yi<D ( y 2 <0, . . . y^D from the symmetrical nonlinear 
converting unit 14 with "1" and values obtained by 
multiplying the output values yi (l Vy2 (1) . .... yj/^with 
"2", thus obtaining an output S^. 15 

SjW corresponds to a score of the subspace method 
alone. Sa/*0 can be regarded as a score of the neural 
network alone when projective components to each 
subspace are input to the neural network. A determin- 
ing section 18 performs primary determination based on 20 
higher-order scores (Ssi, S52. • . .) of S$W. As a result, 
when a plurality of candidates are obtained, the recog- 
nition result is determined based on S^W. 

In this embodiment, neural network computations for 
all the categories are performed for an input. However, 
in a practical application, certain recognition candidates 
may be selected by utilizing the output from the prede- 
termining section 16, and only higher-order candidates 
may be subjected to computations by the neural net- 3Q 
work unit 17. The number of layers of the neural net- 
work may be changed within the spirit and scope of the 
invention. 

In this manner, in the pattern recognition system and 
method of the present invention, inner products be- 35 
tween an input pattern and the orthogonalized dictio- 
nary sets are computed and are nonlinearly converted, 
and the converted values are supplied to the neural 
network unit 17 to execute pattern recognition process- 
ing. According to the system and method of the present 4Q 
invention, accurate recognition processing can be real- 
ized by the inner product computations and nonlinear 
conversion without performing complicated coefficient 
computations in earlier stages in the neural network. As 
a result, an input pattern can be easily and effectively 45 
recognized with high accuracy using the neural net- 
work in which coefficient computations are simplified, 
while drawbacks of the subspace method in terms of 
various noise components are effectively eliminated. 

FIG. 7 is a block diagram showing the second em- 50 
bodiment of a pattern recognition system according to 
the present invention.. Note that the same reference 
numerals in FIG. 7 denote the same parts as in FIG. 5, 
and a detailed description thereof will be omitted. In 
this embodiment, pattern recognition processing is exe- 55 
cuted using the statistical discriminant function de- 
scribed above in place of the neural network. In this 
embodiment, input speech is analyzed by a feature ex- 
tracting unit 11 as in the first embodiment to obtain its 
power and LPC analysis parameter, and a speech inter- 60 
val is detected to extract a speech feature (input pat- 
tern). An inner product computing section 12 computes 
inner products between the input pattern and orthogo- 
nalized dictionaries prepared in units of categories and 
stored in an orthogonalized dictionary memory 13. The 65 
inner products are nonlinearly converted by a symmet- 
rical nonlinear convening unit 14. Thereafter, a statisti- 
cal discriminant function computing unit 19 performs 
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predetermined coefficient computations using the non- 
linearly converted values. 

The statistical discriminant function computing unit 
19 executes coefficient computation processing by re- 
ferring to a coefficient registering unit 20 which stores 
mean vectors and covariance matrices obtained from 
learning data in units of categories in the same manner 
as the orthogonalized dictionary sets. 

More specifically, the statistical discriminant function 
computing unit 19 receives the nonlinearly converted 
values G (=(S1, S2, . . . , SM); M is the number of 
orthogonal axes), and executes coefficient computation 
processing in accordance with equation (2) or (3) be- 
tween the inputs and coefficients given from the coeffi- 
cient registering unit 20, thus obtaining evaluation val- 
ues. Of the evaluation values (outputs) computed in this 
manner, a category which provides a maximum output 
of all the categories (in the Bayes decision, a minimum 
value for the Maharanobis distance) is discriminated as 
a recognition result for the input pattern, thus recogniz- 
ing the input pattern. 

In this case, certain candidates may be selected by 
utilizing an output from a predetermining section like in 
the first embodiment without computing the statistical 
discriminant function for all the categories, and higher- 
order candidates may be subjected to discriminant func- 
tion computations. 

The present invention is not limited to the above 
embodiments. 

For example, the number of axis of orthogonalized 
dictionaries used in inner product computations, the 
number of categories to be recognized, and the number 
of layers constituting the neural network may be deter- 
mined in accordance with specifications of pattern rec- 
ognition. The nonlinear function for nonlinearly con- 
verting inner products is not limited to the one de- 
scribed above. 

What is claimed is: 

1. A speech pattern recognition system comprising: 

speech input means for inputting a speech pattern to 
be recognized; 

feature extracting means for extracting feature vec- 
tors from the input speech pattern; 

dictionary mean for storing a plurality of orthogonal- 
ized reference patterns, which are equal to eigen- 
vectors calculated using Karhunen-Loeve (KL) 
expansion, and whose categories are known; 

inner product computing means, provided for each 
category, for computing inner products of the 
input speech pattern, using said feature vectors, 
whose category is unknown and the orthogonal- 
ized reference patterns of a category which are 
stored in said dictionary means and whose cate- 
gory is known;' 

converting means, provided for each category, for 
nonlinearly converting the inner product of a cate- 
gory, which is computed by said inner product 
computing means, in accordance with a positive- 
negative symmetrica] nonlinear function; 

evaluation value computing means, provided for each 
category and each evaluation value computing 
means including a multi-layer neural network 
whose elements each include multiplier means for 
multiplying an input signal from a lower adjacent 
layer by a preset weight coefficient and being con- 
nected to all elements constituting an upper adja- 
cent layer, for computing an evaluation value of a 
category using the nonlinearly convened values 
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calculated by said converting means and weight 3. A speech pattern recognition system, comprising: 

coefficients which are preset in said neural network speech input means for inputting a speech to be rec- 

of a category based oh a discriminatively trained ognized; 

Back Propagation (BP) algorithm; feature extracting means for extracting feature vec- 
comparing means for comparing the evaluation val- 5 tors from the input speech; 
ues obtained by said evaluation value computing multi-layer neural networks whose elements each 
means using said neural networks with other evalu- include multiplier means for multiplying an input 
ation values; and signal from a lower adjacent layer by a preset 
discriminating means for outputting a category to weight coefficient and connected to all elements 
which the input speech pattern belongs as a recog- 10 constituting an upper adjacent layer and, respec- 
nition result using comparison results of said com- tively provided for categories of speech, for recciv- 
paring means. ing the feature vectors as input, and for outputting 
2. A computer-implemented speech pattern recogni- evaluation values for each category, each of said 
tion method comprising the steps of: neural networks having, as weight coefficients of a 
inputting a speech pattern to be recognized; 15 hidden layer, the weight coefficients being com- 
extracting feature vectors from the input speech; puted based on a discriminatively trained Back 
computing inner products of the input pattern, using Propagation (BP) algorithm, eigenvectors calcu- 
said feature vectors, whose category is unknown latetTin advance by Karhunen-Loeve (KL) expan- 
and orthogonalized reference patterns of a cate- sion from known feature vectors belonging to one 
gory which are equal to eigenvectors calculated 20 of said categories, a nonlinear function of an output 
using Karhunen-Loeve (KL) expansion, and whose of the hidden layer being a positive-negative sym- 
category is known; metrical nonlinear function; and 
nonlinearly converting the inner product of a cate- determining means for determining a category as a 
gory, which is computed in the computing step, in recognition result from evaluation values output 
accordance with a positive-negative symmetrical 25 from the multi-layer neural networks, 
nonlinear function; 4. A speech pattern recognition system as recited in 
computing an evaluation value from the nonlinearly claim 1, wherein said multi-layer neural network corn- 
converted values converted by the converting step prises: - 

and weight coefficients using a multiple neural second inner product computing means for comput- 

network whose elements each include multiplier 30 ing second inner products using said nonlinearly 

means for multiplying an input signal from a lower converted inner products of a category from said 

adjacent layer by a preset weight coefficients and converting means and said preset weight coeffici- 

connected to all elements constituting an upper ents; 

adjacent layer in which said weight coefficients are second converting means for nonlinearly converting 
preset for each category based on a discrimina- 35 said second inner products computed by said sec- 
tively trained Back Propagation (BP) algorithm; ond inner product computing means; 
comparing in the evaluation values obtained from means for obtaining a sum of the nonlinearly con- 
said neural networks in the evaluation value com- verted second inner products from said second 
puting step of said categories with other evaluation . converting means; and 

values; and 40 third converting means for nonlinearly converting 

discriminating and outputting a category to which the sum of the nonlinearly converted second inner 

the input pattern belongs as a recognition result products, thereby producing said evaluation value, 

from comparison results of the comparing step. * * * * * 
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